IMPROVEMENTS IN OR RELATING TO SWITCHING DEVICES 

The present invention relates to improvements in or relating to 
switching devices, and is more particularly concerned with a method of 
adjusting bandwidth in such devices. 
5 Traffic volume in the Internet is growing exponentially, doubling 

every three to six months. The current capacity of Internet Protocol (IP) 
routers is insufficient to meet this demand and hence products that can route 
IP traffic at extremely large aggregate bandwidths of the order of lOGbit/s to 
several Tbit/s. Such routers are termed "Super Routers^. 
10 Additionally, there is a growing need to support multicast (one to 

many or many to many) communications within the internet or any other IP 
based network. To support such a service, an IP router must be able to 
replicate packets and send them to multiple outputs on a per packet basis. In 
a router where bandwidth allocations are strictly controlled (in order to 
1 5 support Quality of Service criteria), it is necessary to determine how much 
bandwidth to allocate to multicast traffic across the core switching fabric. 

It is therefore an object of the present invention to provide a method 
which overcomes the problems mentioned above. 

In accordance with one aspect of the present invention, there is 
20 provided a method of allocating bandwidth for multicast traffic in a switching 
device connected between a plurality of ingress means and a plurality of 
egress means, the method comprising the steps of:- 

a) determining ingress multicast queue occupancy for each ingress 

means; 

25 b) determining the number of multicast cells received by the egress 

means from the ingress means in the last bandwidth allocation period; 
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c) determining the bandwidth at each ingress means and egress 
means after real time bandwidth allocations have been taken into account; 
and 

d) calculating the bandwidth allocation for the next bandwidth 

5 allocation period in accordance with the values determined in steps a), b) and 
c)- 

For a better understanding of the present invention, reference will 
now be made, by way of example only, to the accompanying drawings in 
which:- 

10 Figure 1 illustrates an ingress forwarder scheduling function; 

Figure 2 illustrates centralised bandwidth allocation in accordance 
with the present invention; and 

Figure 3 illustrates effective queue lengths for non-real time multicast 
bandwidth allocation. 

1 5 The present invention relates to a method of dynamically adjusting 

the bandwidth, allocated to multicast traffic, across an asynchronous transfer 
mode (ATM) switch or crossbar like switching fabric that joins several IP 
packet forwarder functions to form a "Super Router" node. 

In order to prevent head of line blocking, unicast traffic is queued in 

20 separate logical scheduling entities (called scheduler blocks) according to 
which egress forwarder it is destined. The scheduler block serves a set of 
queues (per class or per connection) via any mechanism desired (e.g. strict 
priority or Weighted Fair Queuing) provided that the real time IP traffic class 
is guaranteed a minimum bandwidth. 

25 However, for multicast traffic, it is not practical to queue traffic on 

the basis of a unique combination of egress destinations. This is because the 
number of queues required becomes unmanageable even for a relatively 
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small number of egress ports. Hence, a separate multicast scheduler block is 
used in each ingress forwarder containing one real time multicast queue and 
one or more non-real time multicast queues as shown in Figure 1 . 

Figure 1 shows an ingress forwarder 100 which includes a unicast 

5 scheduler block 1 1 0 for scheduling unicast traffic and a multicast scheduler 
block 130 for scheduling multicast traffic. Although only one unicast 
scheduler block 1 10 and one multicast scheduler block 130 are shown, it will 
be appreciated that any number of such scheduler blocks may be provided in 
any combination according to a particular application.^ 

10 The unicast scheduler block 110 comprises a plurality of queues 112, 

114, 116 connected to a scheduler 118 which has an output 120 connected to 
a particular egress forwarder (not shown), for example, egress forwarder 1 as 
indicated. Although only three queues 112, 114, 1 16 are shown, it will 
readily be understood that any number of queues may be provided in 

1 5 accordance with a particular application. 

The scheduler 118 has a scheduling rate which is determined by 
unicast bandwidth allocation and operates to transmit cells 122, 124, 126 at 
the head of respective queues 112, 114, 116 according to their priority, as 
indicated by arrow 128, to the output 120. Unicast bandwidth allocation is 

20 described in co-pending British patent application no. 99073 1 3.2 (docket 
number F21558/98P4863), incorporated herein by reference. 

The multicast scheduler block 130 comprises queues 132, 134 - a real 
time queue 132 and a non-real time queue 134. Both queues 132, 134 are 
connected to a scheduler 136 through which all multicast traffic passes. The 

25 scheduler 136 has an output 138. 
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It will readily be appreciated that although only one real time queue 
and one non-real time queue are shown, there may any number of such 
queues depending on the particular application. 

Cells 142, 144 at the head of respective ones of the queues 132, 134 
5 are selected for passage through the scheduler 136 to output 138 in 
accordance with a priority basis as indicated by arrow 146. 

Incoming IP traffic from the line is queued in the relevant queues 
associated with a scheduler block. The choice of scheduler block is governed 
by the destination egress forwarder and whether it is multicast or unicast 
_ 10 traffic. The class of service determines the specific queue to be utilised. 

The overall view of a centralised bandwidth allocation arrangement 
200 is shown in Figure 2. The arrangement 200 comprises a plurality of 
ingress forwarders 210, a plurality of egress forwarders 220, a switching 
network 230 and a bandwidth allocation controller 240. Each ingress 
15 forwarder 212, 214, 216, 218 can be connected to one or more egress 
forwarders 222, 224, 226, 228 as required via the switching network 230 
under the control of the bandwidth allocation controller 240. 

Although only four ingress forwarders 212, 214, 216, 218 and four 
egress forwarders 222, 224, 226, 228 are shown, it will be appreciated that 
20 any number of ingress and egress forwarders can be provided in accordance 
with a particular application. 

As shown in Figure 2, each ingress forwarder 212, 214, 216, 218 
interfaces with the bandwidth allocation controller 240 via links 242 and 244 
- only the links 242, 244 to ingress forwarder 212 being shown for clarity. 
25 Link 242 provides the bandwidth allocation controller 240 with information 
relating to buffer occupancy, arrival rate of packets of cells etc for each 
ingress forwarder 212, 214, 216, 218. Link 244 provides each ingress 



forwarder 212, 214, 216, 218 with scheduling decisions made by the 
bandwidth allocation controller 240. 

Similarly, each egress forwarder 222, 224, 226, 228 interfaces with 
the bandwidth allocation controller 240 via link 246 which provides the 
5 bandwidth allocation controller 240 with information relating to the multicast 
cells sent. Again, only the link 246 with egress forwarder 222 is shown for 
clarity. 

For every fixed period, that is, the Bandwidth Allocation Period 
(BAP), each ingress forwarder 212, 214, 216, 218 sends buffer occupancy 

1 0 (and possibly other information) via link 242 to the central bandwidth 

allocation controller 240. In addition, each egress forwarder 222, 224, 226, 
228 sends information via link 246 on how many multicast cells were 
received in the last BAP from each ingress forwarder 212, 214, 216, 218. 
The bandwidth allocation controller 240 works out the allocation of 

1 5 bandwidth between all ingress/egress forwarder pairs for the next BAP and 
uses this to provide scheduling information to the ingress forwarders 212, 
214, 216, 218 via link 244 telling them which cells/packets to transmit in the 
next cell period. 

However, in order to include multicast functionality into the 

20 bandwidth allocation process some additions are required to the unicast 
algorithm defined in British patent application no. 99073 13.2 mentioned 
above. The unicast bandwidth allocation algorithm essentially divides the 
available bandwidth at ingress and egress amongst competing forwarders 
using the ingress queue length as a weighting factor. The queue length of the 

25 unicast scheduler block for traffic on ingress forwarder i destined for egress 
forwarder j is denoted by q ir Thus, for example, the amount of bandwidth 



allocated to unicast traffic from ingress forwarder i to egress forwarder j at 
the egress, be ip is given by the following equation:- 

bdj = ABWEj * ^^T~ (!) 

k 

Here, ABWEj is the available bandwidth after real time reservations 
5 have been accounted for at the egress forwarder j, and Xq ik is the sum of the 
buffer occupancies for data destined for egress forwarder j in ever ingress 
forwarder. The term q ik represents the buffer occupancy of the scheduler 
block in ingress forwarder i destined for egress forwarder j. 

For real time multicast flows, the fan-out and bandwidth guarantees 
10 are known in advance and the sum of all ingress and egress requirements can 
be subtracted from the available bandwidth in the same way as for real time 
unicast traffic flows. 

As the amount of egress bandwidth required for non-real time 
multicast flows is not known (compared with the case for real time 
15 multicast), it must be determined by the system. One way of determining the 
amount of egress bandwidth required is to collect statistics at the egress 
forwarders on the number of multicast cells received from each ingress 
forwarder in the last BAP. These statistics can then be included in the queue 
statistics message sent from the ingress forwarder to the central scheduler 
20 every BAP. 

Although Figures 1 and 2 have been described with reference to 
ingress and egress forwarders, it will be appreciated that the ingress and 
egress devices are not limited to such devices and may comprise any suitable 
device which enables packets of data to be transferred from one side of a 
25 switching network to another. 
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Figure 3 illustrates a system 300 for calculating non-real time 
multicast bandwidth allocation. The system 300 comprises a plurality of 
ingress forwarders or line interface cards (LICs) 3 10, a plurality of egress 
forwarders or line interface cards (LICs) 320, and a switching network 330. 
5 Each ingress LIC 3 1 2, 3 14, 3 1 6 has at least one queue 342, 344, 346 as 
shown. Each egress LIC 322, 324, 326 receives data from one or more 
ingress LICs 3 12, 3 14, 3 1 6 across the switching network 330 as shown. Only 
one queue 342, 344, 346 is shown in each ingress LIC 3 12, 3 14, 316 for 
clarity. 

10 The ingress forwarder multicast queue occupancy is denoted as mcq ( 

for ingress forwarder i. The number of multicast cells received by egress 
forwarder j from ingress forwarder i in the last BAP is denoted by mcq^. The 
bandwidth allocated to non-real time multicast flows from ingress forwarder i 
to egress forwarder j is denoted by mcb ir 

15 The value of mcqt is used in the ingress bandwidth fair share in the 

same manner as q y does in the unicast centralised bandwidth allocation 
algorithm. 

The values mcq^ take part in the egress fair share allocation by 
providing a proportion with which to scale the ingress multicast queue 
20 occupancies. This means that the effective weight that the occupancy of the 
ingress non-real time (nrt) multicast queue (mcq^ has on an egress forwarder 
j (called emcq^ is determined by the proportion of nrt cells received by 
egress forwarder j compared to all those received at egress forwarder j in the 
last BAP period. It is therefore governed by the following equation: - 

25 emcq.. = mcq i * ^ Hl ' (2) 



The value of emcq^ will be used in egress bandwidth allocation 
functions alongside the unicast equivalents q i} . 

Thus, the equivalent of equation (1) when including the multicast 
traffic is given in equation (3):- 

emcq.. 

bme 9 = MWEj* (3) 

where bme^ is the rut multicast bandwidth allocated between ingress 
forwarder i and egress forwarder j, ABWE p emcq^ and q ik are the same as 
before. 

Similar principles can be applied at the ingress for bandwidth 
allocation and any left over bandwidth can be distributed between unicast and 
multicast allocations by any fair mechanism required. 

The ingress equation for nrt multicast bandwidth becomes:- 

bmi g = Mwi* in ^\ (4) 

k k 

ABWIj is the available bandwidth at the ingress forwarder i after real 
time traffic reservations have been taken into account. The term imcq i} is the 
ingress equivalent of emcqy and is the effective weight with which to scale 
the ingress multicast queue occupancy. It is calculated from equation (5). 

mcqy 



imcq.=mcq*^- 



(5) 



The actual allocated multicast bandwidth between ingress forwarder i 
and egress forwarder j is the minimum of the ingress and egress bandwidth 
allocations as defined in equation (6). 

bmy =va ^bmi 9 bme,) (6) 



Any remaining bandwidth not allocated after this process has been 
carried out can be distributed between unicast and multicast allocations in 
any fair manner desired. 

The advantages of the scheme described above include: 

a) A 100% efficient distribution of bandwidth 

b) A fair distribution of nrt bandwidth according to ingress queue 
occupancies 

c) Prevention of overload of the cross-bar by restricting the total 
amount of bandwidth allocated at ingress and egress ports to the 
cross-bar. 



